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A Symmetric Continuous Poker Model 

A. J. Goldman and J. J. Stone ' 

(August 18, 1959) 

Beginning with von Neumann, mathematicians concerned with the rational analysis of 
conflict situations have realized that investigation of accurate or simplified versions of com- 
mon card games leads to techniques and insights applicable to situations of military or 
economic interest. In the present paper, a symmetric poker model one stage more compli- 
cated than the original von Neumann game is solved. There are two bet levels, a, b, and 
an ante of 1 unit (a>6>l); no raises are permitted. The game has a unique optimal 
strategy, which forbids bluffing on a low hand. The limiting case b== 1 is shown to yield 
the von Neumann model. 



1. Introduction 

The technical literature of the mathematical theory 
of games of strategy can be roughly subdivided into 
(i) general papers, which deal with concepts, existence 
theorems and computational methods relevant for 
broad classes of games, and {ii) specific papers, in 
which the solutions of particular games are deter- 
mined. Although many of the studies in the second 
category have dealt with problems suggested by eco- 
nomic or military conflicts, e.g. [1, 2] 2 } a substantial 
fraction have dealt with accurate or simplified ver- 
sions of common card games, the best known example 
being von Neumann's analysis of a simple poker 
model ([3], sec. 19.4 to 19.10). Bridge and baccarat 
have been treated [/+, 5], but poker has remained the 
favorite topic for these studies [3, 6 to 10], perhaps 
because the poker models investigated have pre- 
sented just the right degree of challenge ("not easy, 
but not impossibly hard") to the ingenuity of the 
analyst. Of course the techniques and insights de- 
veloped in these investigations are then applicable to 
situations of greater practical importance. 

The present paper deals with a symmetric poker 
game one stage more complicated than von 
Neumann's original model. The game is continuous 
in the sense that the difficulties [6] due to the enor- 
mous but finite number of possible hands have been 
avoided by assuming (as von Neumann did) an 
infinite continuum of possible hands. The (more 
difficult) asymmetric version has also been solved 
by the authors; the results will appear elsewhere [11]. 

2. Description of the Game 

The rules of the game are as follows. The two 
players R and S first ante 1 unit. They then receive 
independent random numbers (hands) from the in- 
terval [0, 1], Each player knows his own hand, but 
not that of his opponent. The players act simul- 
taneously, each either dropping, betting an additional 



> Present address, Stanford University. 

2 Italicized figures in brackets indicate the literature references at the end of 
this paper. 



6 — 1 units, or betting an additional a— 1 units. If 
both drop, then play ends and no payment is made. 
If both bet 6—1 units or both bet a— 1 units, then 
play ends and the player with the higher hand 3 wins 
the ante and bet of his opponent. If one player 
bets a— I units and the other bets 6—1, then the 
latter can either jold (in which case he loses his ante 
and previous bet to his opponent) or see 4 by betting 
an additional a—b units (in which case the player 
with the higher hand wins the ante and previous 
bet or bets of his opponent); in either case play ends 
Naturally we assume a>6>l. 

Each player has four courses of action for any given 
hand. These are (1) to drop, (2) to bet 6-1 units 
with the intention of folding if the opponent bets 
a-1 units, (3) to bet 6-1 units with the intention 
of seeing if the opponent bets a-1 units, and (4) 
to bet a-1 units. Figures 1 and 2 show the algebraic 
payoff from player S to player R for any choices 
of courses of action by the players. 

A strategy (for either player) is a quadruple 
r(z) = {Ti(z)} (i= 1,2,3,4) of functions defined for 
0<2<1. Since r t (z) represents the probability 5 
that the player will choose the iih course of action 
when his hand is z, the restrictions 

r t (z)>0, r l (z)+r 2 (z)+r 3 (z)+r i (z) = l 

are imposed. The "smoothness" conditions on these 
functions are (i) that they be integrable, (ii) that 
if r*(s')>0 then, for any interval J of which z' is 
an interior 6 point, 



x 



r ( (t)dt>0, 



and finally 7 (Hi) that 



j: 



[br 2 (t)-arS)]dt=0 on [(& — lj/6, (a—\)ja\ 



3 The possibility of equal hands has zero probability and so can be disregarded. 

4 This possibility distinguishes our model from the case m='i of the version 
studied in part II of [6]. 

5 More precisely, n(z) is a probability density function. 

s The words "interior" and "open" are to be understood relative to the interval 
[0,1]. 

7 This very weak but ad hoc condition is precisely what is needed to prove 
(b) and (c) of lemma 8. 
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implies 

br 2 (z)-ar 4 (z)=0 on [(6 — 1)/6, (a-l)/a] 
It is convenient to define, for any r(z), 



Ri= J r t (t)dt. 



For any r(z), Gi(z) will denote the expected payoff 
to player R, using r(z), against a fixed hand z and a 
fixed course of action i of his opponent. Thus the 
expected payoff function of the game is given by 

K(r,r*)=£ J2r i ^(z)G i (z)dz (2.1) 

The explicit formulas 8 for the G t (z) are readily 
found from figures 1 and 2 : 
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Figure 1 

R has the higher hand. 



Figure 2 

£ has the higher hand . 



G l {z)=\\r 2 {t)+n{t)+r i {t)]dt 

+j\r2(t)+r 3 (t)+ ri (t)]dt, (2.2) 

6 2 (z)=r[-r 1 (t)-br 2 (t)-br 3 (t)+br i (t)]dt 

+ (\-r 1 (t)+br 2 (t)+br 3 (t)+br i (t)]dt > (2.3) 

G i (z)=('[~r 1 {t)-br 2 (t)-bn(t)-ar i (t)]dt 

+£[-r l (t)+br 2 (t)+br 3 (t)+ar 4 (t)]dt, (2.4) 

GJz)= i "[-rj(0-6r 2 (0-ar 3 (0-ar 4 (i)]^ 

+ P [-r^-br^D+ar^+anitm. (2.5) 



8 Note that the Gi(z)'s are shown by these formulas to be continuous functions. 



3. Plan of Attack 

It would not be very instructive simply to state 
the solution of the game and demonstrate its correct- 
ness. Instead, a slightly polished version of the 
process actually used to solve the game will be pre- 
sented. This is done in sections 4 and 5, in which 
it is proved (subject to the plausible assumption 
stated in sec. 4) that the only possible optimal strat- 
egy is the one obeying the conditions (a) to (d) of 
lemma 8, which deal with the behavior of the strat- 
egy on three intervals [0,r/), (g,g f ), and (g',1] which 
might be thought of as the sets of low hands, inter- 
mediate hands, and high hands. It is then relatively 
easy to verify that this particular strategy is indeed 
the unique optimal strategy; this is done in section 6, 
primarily by inspection of the expected payoff func- 
tion (2.1). In section 7 the limiting case 6=1 is 
dealt with, and this game is found to have a unique 
optimal strategy obtained 9 by "passing to the limit" 
in the optimal strategy for 6>1. The von Neumann 
poker game of sections 19.4 to 19.10 of [3] is shown 
to arise as a "reduced game" from this limiting case. 

In section 4 it is shown that r z (z) = on [0,1] for 
every optimal r(z). The points g, g' are located, and 
the unique optimal r(z) is isolated in section 5. The 
arguments of these sections are essentially all appli- 
cations of the following domination criterion: 

The strategy r(z) is optimal only if the following is 
true: For each z, i jor which Gi(z) (as a function of i) 
does not assume its minimum, rt(z) = 0. 

The correctness of this criterion follows from the 
smoothness condition (it) and the fact that r(z) is 
optimal only if 

K(r,r)=imn K(r } r*). 



4. Elimination of One Course of Action 

The elimination is effected as follows. 

Lemma 1. For every optimal r(z), r s (z) = on 

[0,1]. 

Otherwise, for some optimal r(z), R z > and 
r 3 (s)>0 for some z^>0. If z' is the least upper 
bound of the set of all such z, then by (2.4) and (2.5) 

so that by continuity and the domination criterion 
r 3 (^)=0 in a neighborhood of z', contradicting the 
definition of z r . 

The following plausible assumption (which will 
be verified in sec. 6) is now made: 

Assumption: For every optimal r(z), r 4 (l)>0 
(and thus R^> 1) . 



9 This is not quite the case for the asymmetric version :see' 11]. 
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5. Derivation of the Necessary Conditions 

Since application of the domination criterion 

requires examining the signs of the pairwise differ- 

! encesof the Gi(z)'s, the formulas for these differences 

(based on (2.2) to (2.5) and the condition r 3 (z)=0 
on [0,1 1) will first be given: 

G 2 (z)-G l (z) = (b-l)-bR 1 -2b | \ 2 {t)dt (5.1) 

Jo 

G 3 (z)-G 1 (z)=bR 2 +aR i -l-2b ( % a (t)dt 

-2a | °rS)dt (5.2) 
Jo 

G 3 (z)-G. 2 (z) = {a-b)B A -2aj\(t)dt (5.3) 
Q t (z)-G l {z)= -l-bR 2 -\ aR t -2a f \(t)dt (5.4) 

Jo 

G,(z)-G 2 (z) = 2bR 2 +(a-b)R 4t -2b J V 2 (0* 

-2aJ r 4 (*)eft (5.5) 

G,{z)-G z {z)= -2bR 2 +2b J V f (*)* (5.6) 

The first four of these continuous functions are 
clearly monotone nonincreasing. The following 
table 10 defines the maximal open intervals in which 
these functions arc positive ( -| •), zero (0), and 
negative (— ). For each interval in the (-f ) and ( — ) 
columns, the result of applying the domination 
criterion is given. Inspection of (5.1) to (5.4) 
reveals conditions on the intervals in the (0) column 

io Of course lemma l renders superfluous Uie appearance of r3(c)=0 in the table. 



which are given below the interval. A question 
mark means that the interval may (so far as is 
known at the moment) be vacuous or degenerate 
to ;i single point; the condition written below such 
an interval is to apply only if the interval is non- 
degenerate. 

The next three lemmas yield the values of AY R 2 
and i?4 for optimal r(z) (lemma 1 shows that AV 0). 

Lemma 2. For every optimal r( : \: 

(a) G 2 (0)-G 1 (0) = (b-l)-bR 1 >0. 

(b) # 4 (0)-#i(0)= -l-bR 2 -\ ah\<(). 

If (a) were false, then the monotonicity of 
G 2 (z) — Gi(z) and the domination criterion would 
imply 2?i=0, so that (a) would la* true. If (b) were 
false, then since c? 4 (l) — G 1 (\)<i0 one would have 

ft(« / )-ft(« / )=o 

for some z f in (0,1); by (5.10) n(z) = on [0,g')j so 

by (5.4) 

Gt(0)-G 1 (0) = G 4 (z')-G l (z')=0 
and (b) would be true. 

Lemma 3. For every optimal r(z): 

(a) G 2 (0)-G 1 (0) = (b-l)-bR 1 =0. 

(b) fl r 4 (0)-ft(0)= -l-MVI aR 2 =0. 

If # 2 (1) — #i(l)>0. then the monotonicity of 
G 2 (z) — Gi(z) and the domination criterion would 
imply #2=0, so that on the one hand the initial 
assumption would vend 

(b-\)-bh\>0, 



Difference 


(+) 


(0) 


(-) 


G,{z)-G,{z) 


[O,.^- 1 )? 
r 2 (z)=0 


r,(*)=0 


ri(«)=0 


GM~G l (z) 


[0,</ 3 i)? 
r 3 (z)=0 


(<M)? 

>\(z) = l 


Gf + M] 
r,(z)=0 


G 3 (z)-GM 


[0,<f?) 
r 3 (z)=0 


n(z)=o 


(<7 3 + V] 
r,(*)=0 


Gi(z)-Gi(z) 


[oyj)? 

r 4 (z)=0 


(ff 4 - 1 ,^)? 

r 4 (2) = 


ri(z)=0 



(5.7) 

(5.8) 

(5.0) 

(5.10) 



37 



and on the other hand Ei=l—R 4 (by lemma 1, 
using i? 2 =0). Combining these statements yields 
7? 4 >l/6, which together with (5.4) and R 2 =0 yields 

#4(0) -ft (0)= -l+ai? 4 >0, 

contradicting (b) of lemma 2. Thus G 2 (l) — Gi(l)<0y 
which together with (a) of lemma 2 implies that 

G 2 (z')-G 1 (z') = 

for some z r in [0,1]. By (5.7) r 2 (z) = on [0,s'); s ° 
by (5.1), 

G 2 {0)-G 1 (0) = G2(z')-G 1 (z')=0, 

and (a) is proved. If (b) were false, then by (b) of 
lemma 2, the monotonicity of G±(z) — G^z) , and the 
domination criterion, we would have R\ = 0, con- 
tradicting (a). 

Lemma 4. For every optimal r(z): 

(a) B 1 =(b-l)/b. 

(b) R 2 =(a-b)/b(a\ b). 

(c) R 4 =2/(a + b). 

This follows from lemma 3 and the corollary 
/?i+i?2+#4=l of lemma 1. 

Next the points g, </ will be located, and the 
behavior of an optimal r(z) on [0 } g), (g,g') and 
((/, 1] determined. 

Lemma 5. For every optimal r(z): 

(a) g n=<ft=g=(b-l)/b. 

(b) r,(2)=l on[0,ff). 

(c) ^=0 on (0,1]. 
By (5.5) and lemma 4, 

Gt(z)-G 2 (z)=2b ( Z r 2 {t)dt-2a f V 4 (*)<ft (5.11) 

Jo Jo 

If 0?<0¥, then by (5.7) and (5.10) one has r 2 (z) = l 
on {g 2 l <#), r 1 (z) = l on [0, .<#),. and r,(z) = on 
[0, #+), so that the last equation yields 

GM)-W1)>0; 

continuity and the domination criterion show that 
r 4 (z)=0 in a neighborhood of g+, but (5.4) and (b) 
of lemma 3 show that this contradicts the definition 
of g+. An entirely similar argument refutes the pos- 
sibility (ft<fft, and so sft=fi. By (5.7) and (5.10) 
we have r,(z) = l on [0, g^) and n(z) = on (# 2 +\ 1], 
so that g 2 l = R i ; an application of (a) of lemma 4 
completes the proof. 



Lemma 6. For every optimal r(z): 

(a) 0?=0' = (a-l)/a. 

(b) r^lwifj',1]. 

Note first that r 4 (z) = l on (eft, 1] since (i) r 2 (^) = 
on (<ff, 1] by (5.9), (ii) r l (z) = on (#, 1] by lemma 5, 
and (iii) g<Zg+ because G 3 (g) — G 2 (g)^>0 by (5.3) and 
the fact (see (5.10)) that r 4 (z) = on [0, flf). It follows 
that 



I; 



5+ r 4 (0^+(l-^ 3 + 2 )=i?4=2/(a+6). 

The definition of </+ together with (5.3) shows that 

[ 8 ^r A (t)dt=(a-b)R 4 f , 2a=(a-b)/a(a+b) ) (5.12) 

and combining the last equations yields the asserted 
value of #+. 

Lemma 7. For every optimal r(z), 

G 4 (z)-G 2 (z)=0 on[g,g']. 

Since r 2 (z) = on (#', 1], (5.11) and (5.12) imply 

G 4 (g')-G 2 (g')=2bR 2 -(a-b)R 4 =0 9 

where the last equality follows from lemma 3. Let 
2* be a point at which G 4 (z) — G 2 (z) attains its maxi- 
mum M on [g,g f ] ; the last equation shows that M> 0. 
If M>0, then by continuity and the domination cri- 
terion one has r 4 (^) = (and thus r 2 (z) = l, since 
r x (z) = on (g,l]) on some interval (z*,z'); by (5.5) 
G A (z) — G 2 (z) is strictly increasing on (2*,s')> which 
contradicts the definition of 2* unless z* = g', in 
which case M=0 by the last equation. Thus M=0; 
an entirely similar argument shows that the mini- 
mum value of G±(z) — G 2 (z) on [g,g'\ is m=Q, and so 
the lemma is proved. 

The completion and summary of the preceding 
material is as follows. 

Lemma 8. Let g=(b—l)/b and g / =(a—l)/a. If 
the assumption at the end oj section 4 is correct, then 
the only possible optimal strategy is the strategy defined 
by 

(a) n{z) = l on[0,g). 

(b) r 2 (z)=a/(a+b) on {g,g'). 

(c) r 4 (z)=b/(a+b) on (g,g'). 

(d) r 4 (s)=l onfo',1]. 

From lemma 1, lemma 5, and lemma 6 one has (a), 
(d), and also 



r 2 (z) J rn(z) = l on (g,g'). 



(5.13) 
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From lemma 7. (5.5), and lemma 4 one has 
\br 2 (t)-ar 4 (t)]dt=0 on (g,g'), 



. 



which by smoothness condition (in) (see section 2) 
implies 

br 2 (z) — an(z)=0 on (g,g f ); 
this and (5.13) imply (b) and (c). 

6. Proof That the Conditions Are Necessary 
and Sufficient for Optimality 

In this section the notation r(z) is reserved for the 
particular strategy described in lemma 8, while arbi- 
trary strategies are denoted /•*(,:). 

Lemma 9. K(r } r%)=rnm K(r,r*) if and only if 

r*(z) obeys 

(a) r%(z) on [0,#). 

(b) r * l (z)=rt(z) = on (g,g f ). 

(c) r?(2)= /•?(:) on (gr',1]- 

The K(r,r*) of (2.1) is first written as the sum of 
these integrals: 



i: 
j; 



[e 1 (3)+r?( 2 )(« 2 (0)-« 1 ( 2 ))+r?( 2 )((?3(2) 

-e i (^))-|-r?(^(^(^)-^(.:))k/.:. 

[G 2 (z)+rUz)(6 1 (z)-G 2 (z))+rt(z)(G,(z) 

-G 2 (z))+r*(z)(G 4 (z)-G 2 (z))]dz. 

[0.(z) + rUz) (G 1 (z)-G 4 (z)) +r?(3) (£ 2 C?) 

-^ 4 (^)) +r*( -) «7 3 (£') -^(.3)) |,/ : . 



It is easily verified that r(z) has the properties 
described in lemma 4, and since r z (z) = on [0,1], 
r(z) obeys (5.1) — -(5.6). Using these facts, oik 4 finds 
that on [0,g) the coefficients of r*(z) and r*(z) in the 
first integral vanish, while that of r*(z) is positive. 
Hence r*{z) minimizes the first integral if and onlv 
if it satisfies (a). Similarly, r%(z) minimizes the 
second integral if and only if it satisfies (l>), and 
minimizes the third integral if and onlv if it satisfies 
(c). 

Theorem. r(z) is the uniqm optimal strategy. 

Since r(z) obeys (a) to (c) of lemma 9, 
K(r,r)= min K(r>r*) 



which (since the game is symmetric) implies that 
r(z) is optimal. Thus any optimal r*(z) must satisfy 

K(rj*)-m\n K(r,r*) 

r* 

and thus, by lemma 1 and (c) of lemma 9, must. 
satisfy r*(l) = l. Hence the assumption at the end 
of section 4 is correct, and so, by lemma 8, r(z) is 
the only optimal strategy. 

7. Discussion of a Limiting Case 

The limit ing case b = 1 will now be briefly discussed. 
(Recall that 6>1 previously.) It is easily verified 
that the results of the previous sections remain 
valid (many of the proofs can be considerably simpli- 
fied), so that the game with b=l has a unique opti- 
mal strategy which is obtained from that for 6>1 
(i.e., from (a) to (d) of lemma 8) by setting 6 = 1. 
Since g = for 6=1, the unique optimal strategy 
is given by 

(a) r 2 (z)=a/(a+l) on [0,(a-l)/a). 

(b) r 4 (z) = l/(a+l) on [0,(<z-l)/a-) 

(c) /-,(.:) = 1 on ((a-l)/fl,l]. 
Upon changing 

player R, player S, r 2 , r 3 , r 4} a 

to player L, player 2, p 3 , p 2 , pi, a/b 

respectively, one finds that this strategy is precisely 
the unique optimal strategy of the continuous von 
Neumann poker game discussed in sections 19.4 
to 19.10 of [3). To explain this, note thai (2.2) and 
(2.3), with 6=1, yield 



f 

Jo 



Q i {z)-6 l {z) -h\-2\ [r 2 (f)+r 3 (t)]dt<0 
Jo 

for all strategies r(z), which on the one hand shows 
that the first course of action is dominated, and on 
the other hand yields a quick proof 11 that n(z) = 
on [0,1] for all optimal r(z). Thus, without loss of 
optimal strategies, the game can be "reduced" by 
"striking out" the first course of action for both 
players. This reduced game is easily seen to be 
identical with the von Neumann game, except for 
some changes of language and a multiplicative factor 
6 in the expected payoff function. Thus the von 
Neumann game appears as a reduced form of the 
limiting ease b= 1 of the game solved in t his paper. 



11 If nU')>0 for some z', then /?i>0; the formula and the domination criterion 
yield ri(z)=0 for all z, a contradiction. 
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